µ-Fractal Based Data Perturbation Algorithm For Privacy Protection
نویسندگان
چکیده
Many organizations publish anonymous medical data for sociology research, health research, education and other useful studies. Although attributes that clearly identify individuals, such as name and certain personal identity numbers are removed, the combination of some other information, like the date of birth, gender, post-code etc. can still be used to identify an individual. Existing data perturbation techniques are able to de-identify the data prior to publishing, but they suffer from making the process irreversible, so that the original data cannot be fully recovered. How to maintain the usability and utility of privacy-protected data as well as make the published data restorable for authorized users is a major issue. In this paper, we propose a novel robust data perturbation algorithm that can withstand brute force attacks, while the perturbed data pattern is indistinguishable from the original data pattern. A distinguishing feature of our data perturbation method is that, using fractal theory to derive perturbation vectors, it provides high privacy protection together with fully reversible data perturbation while maintaining maximal data utility. Experiments based on practical data confirm the desired operation of our data perturbation algorithm and its effectiveness. The results obtained from our experiments leads us to conclude that the proposed approach is able to computationally resist brute-force attacks as well as maintain the same data distribution type as that of original data.
منابع مشابه
A New Algorithm-independent Method for Privacy-Preserving Classifica- tion Based on Sample Generation
With the development of data mining technologies, privacy protection is becoming a challenge for data mining applications in many fields. To solve this problem, many PPDM (privacy-preserving data mining) methods have been proposed. One important type of PPDM method is based on data perturbation. Only part of the data-perturbation-based methods is algorithm-irrelevant, which are favorable becaus...
متن کاملModified Privacy Preserving Data Mining System for Improved Performance
Privacy of information and security issues now-a-days has become the requisite because of big data. A novel framework for extracting and deriving information when the data is distributed amongst the multiple parties is presented by Privacy Preserving Data Mining (PPDM). The concern of PPDM system is to protect the disclosure of information and its misuse. Major issue with PPDM that exists is to...
متن کاملAn Improved Privacy-Preserving Collaborative Filtering Recommendation Algorithm
Privacy-preserving collaborative filtering is an emerging web-adaptation tool to cope with information overload problem without jeopardizing individuals’ privacy. However, Collaborative filtering with privacy schemes commonly suffers from scalability and sparseness. Moreover, applying privacy measures causes a distortion in collected data, which in turn defects accuracy of such systems. In this...
متن کاملFeature Selection: A Preprocess for Data Perturbation
As a major concern in designing various data mining applications, privacy preservation has become a critical component seeking a trade-off between mining performances and protecting sensitive information. Data perturbation or distortion is a widely used approach for privacy protection. Many privacy preservation approaches were developed, either by adding noises or by matrix decomposition method...
متن کاملAnalyzing Tools and Algorithms for Privacy Protection and Data Security in Social Networks
The purpose of this research, is to study factors influencing privacy concerns about data security and protection on social network sites and its’ influence on self-disclosure. 100 articles about privacy protection, data security, information disclosure and Information leakage on social networks were studied. Models and algorithms types and their repetition in articles have been distinguished a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012